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Neural  Network  Grasping  Controller  for  Continuum  Robots 

D.  Braganza,  D.  M.  Dawson,  I.  D.  Walker  and  N.  Nath 


Abstract —  Continuum  or  hyper-redundant  robots  are  robots 
which  exhibit  behavior  similar  to  biological  trunks,  tentacles 
and  snakes.  Unlike  traditional  robots,  continuum  robot  ma¬ 
nipulators  do  not  have  rigid  joints,  hence  the  manipulators 
are  compliant,  extremely  dexterous,  and  capable  of  dynamic, 
adaptive  manipulation  in  unstructured  environments;  however, 
the  development  of  high-performance  control  algorithms  for 
these  manipulators  is  a  challenging  problem.  In  this  paper, 
we  present  an  approach  to  whole  arm  grasping  control  for 
continuum  robots.  The  grasping  controller  is  developed  in  two 
stages;  high  level  path  planning  for  the  grasping  objective,  and 
a  low  level  joint  controller  using  a  neural  network  feedforward 
component  to  compensate  for  dynamic  uncertainties.  These 
techniques  are  used  to  enable  whole  arm  grasping  without 
using  contact  force  measurements  and  without  using  a  dynamic 
model  of  the  continuum  robot.  Experimental  results  using 
the  OCTARM,  a  soft  continuum  robotic  manipulator  are 
included  to  illustrate  the  efficacy  of  the  proposed  low  level 
joint  controller. 

I.  Introduction 

Continuum  or  hyper-redundant  robot  manipulators  are 
manipulators  which  exhibit  behavior  similar  to  biological 
trunks,  tentacles  and  snakes  [1],  [2],  Unlike  traditional  rigid 
link  robots,  continuum  robot  manipulators  do  not  have  rigid 
joints,  also  the  increased  number  of  degrees  of  freedom 
give  the  manipulator  some  very  useful  properties.  The  ma¬ 
nipulators  are  flexible,  compliant,  extremely  dexterous  and 
capable  of  dynamic  adaptive  manipulation  in  unstructured 
environments.  Due  to  these  inherent  properties  of  soft  contin¬ 
uum  robot  manipulators,  they  are  uniquely  suited  to  perform 
whole  arm  grasping.  Whole  arm  manipulation  [3]  is  the  term 
used  to  describe  the  ability  of  the  manipulator  to  grasp  an 
object  using  its  entire  body,  as  compared  to  fingertip  grasping 
performed  by  traditional  robotic  grippers  and  hands.  Whole 
arm  grasping  is  performed  by  allowing  the  robot  manipulator 
to  make  contact  with  the  object  in  a  snake  or  tentacle-like 
manner,  using  portions  of  the  manipulator  itself  to  wrap 
around  the  object  and  grasp  it.  Several  researchers  including 
[3],  [4]  and  others  have  pointed  out  the  advantages  of  whole 
arm  grasping,  some  of  which  are,  increased  load  capacity 
and  the  ability  to  grasp  objects  of  various  dimensions.  These 
capabilities  can  be  used  in  many  applications,  including 
search  and  rescue,  underwater  and  space  exploration.  The 
grasping  techniques  presented  in  most  of  the  previous  work 
requires  either  that  the  geometry  of  the  object  and  the 
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constraint  forces  to  be  known  a  priori  [5],  or  that  the  contact 
forces  be  measurable  using  some  type  of  force  sensor  [6], 

[7] . 

The  development  of  high  performance  model  based  con¬ 
trol  algorithms  for  continuum  manipulators  is  a  challenging 
problem  for  several  reasons.  For  example,  since  the  arms 
must  be  modeled  as  continuous  curves,  the  kinematic  and 
dynamic  models  are  difficult  to  derive.  Also  the  manipulator 
body  is  soft  and  flexible  which  makes  accurate  control 
difficult.  Several  researchers  have  proposed  kinematic  control 
techniques  for  continuum  manipulators,  for  example  see 

[8] ,  and  the  references  therein.  A  set-point  controller  for  a 
variable  length  manipulator  based  on  an  artificial  potential 
function  method  which  does  not  require  the  dynamic  model 
was  presented  in  [9].  Various  other  techniques  have  been 
suggested  for  tracking  control  of  continuum  manipulators. 
In  [10],  sliding  mode  and  impedance  control  techniques 
for  hyper-flexible  manipulators  were  presented.  In  [11],  the 
authors  present  a  trajectory  tracking  control  of  snake  robots 
based  on  its  dynamic  model.  In  [12],  a  fuzzy  control  method 
was  presented.  Shape  tracking  control,  where  the  manipulator 
follows  a  desired  shape  prescribed  by  a  time-varying  spatial 
curve  was  considered  by  [13].  All  of  the  aforementioned 
techniques  but  [9]  require  the  dynamic  model  of  the  manip¬ 
ulator  be  known. 

In  this  paper,  a  path  planner  is  presented  for  whole  arm 
grasping  which  does  not  require  the  constraint  forces  to  be 
known  a  priori  and  also  eliminates  the  requirement  for  con¬ 
tact  force  sensing.  The  path  planner  is  then  fused  with  a  low 
level  joint  controller  that  uses  a  neural  network  feedforward 
component  to  compensate  for  the  unknown  dynamics  of  the 
continuum  robot  manipulator.  Both  the  planner  and  controller 
are  designed  such  that  force  measurements  are  not  required. 
The  design  of  the  neural  network  is  based  on  the  augmented 
back  propagation  algorithm  [14],  Specifically,  the  neural 
network  is  used  to  compensate  for  the  nonlinear  uncertain 
dynamics  of  the  continuum  robot  manipulator  while  a  non¬ 
linear  feedback  controller  [15]  is  used  to  provide  semi-global 
asymptotic  tracking.  The  advantage  of  the  proposed  control 
scheme  compared  to  previous  works  is  that  semi-global 
asymptotic  tracking  can  be  proved,  whereas  most  previous 
results  for  neural  network  control  of  robot  manipulators  [16]- 
[18]  only  prove  ultimate  boundedness  of  the  tracking  error. 

The  remainder  of  the  paper  is  organized  as  follows, 
in  section  II,  the  kinematic  and  dynamic  models  for  the 
continuum  robot  are  presented.  In  section  III,  the  high  level 
path  planning  for  whole  arm  grasping  is  presented.  In  section 
IV,  the  low  level  joint  control  objective  is  defined,  and  the 
design  of  the  low  level  controller  with  the  neural  network 


feedforward  component  is  presented.  To  demonstrate  the 
performance  of  the  proposed  controller  with  the  neural 
network  feedforward  component,  the  controller  was  tested 
on  the  OCTARM,  a  soft  continuum  robotic  manipulator 
for  a  sinusoidal  trajectory.  Experimental  results  both  with 
and  without  the  neural  network  feedforward  component  are 
presented  in  section  V  to  illustrate  the  effectiveness  of  the 
proposed  control  strategy. 


The  subsequent  development  is  based  on  the  following 
assumptions 

Assumption  2:  The  manipulators  joint  position  q(t)  and 
joint  velocity  q(t)  are  measurable. 

Assumption  3:  The  dynamic  terms  denoted  by  M  ( q ) 
and  N{q,q)  are  unknown  nonlinear  functions  of  q(t)  and 
q[t)  which  are  second  order  differentiable  and  satisfy  the 
following  properties 


II.  System  Models 
A.  Kinematic  Model 

The  forward  kinematic  model  for  an  n-segment  continuum 
robot  can  be  developed  as  follows 


Xn  =  fn{q)  (1) 

where  xn{t)  £  Rp  represents  the  robot  end-effector’s  task- 
space  vector,  q{t)  £  Rn  denotes  the  joint  position,  and 
fn(q)  G  Rp  denotes  the  forward  kinematics  of  the  robot. 
Note  for  continuum  robots,  q(t)  is  a  vector  of  curvatures  and 
extensions  of  the  robot  sections,  for  more  information  see 
[19],  The  velocity  kinematics  for  the  robot  can  be  developed 
as  follows 

Xn  =  Jn(q)q(t)  (2) 


where  xn(t )  £  Rp  represents  the  task-space  velocity,  q(t)  £ 

R”  denotes  the  joint  velocity,  and  Jn{q)  =  ^fn{q)  e  jjpxn 

oq 

denotes  the  robot  Jacobian.  Recently  in  [  19],  the  authors  have 
developed  a  general  method  for  determining  the  kinematics 
of  continuum  robots.  This  approach  enables  the  Cartesian 
position  and  orientation  of  the  end  effector  and  the  robot 
Jacobian  to  be  computed  in  real-time. 

Assumption  1:  The  kinematic  model  for  the  continuum 
robot  is  known ,  and  all  kinematic  singularities  are  avoided 
such  that  J{q)  is  always  non-singular. 


B.  Dynamic  Model 

From  a  review  of  the  literature,  it  is  evident  that  there 
have  been  few  published  results  pertaining  to  the  dynamic 
modeling  of  continuum  robot  arms.  Some  of  the  dynamic 
models  which  have  been  developed  include  [20],  [21],  where 
the  planar  model  of  the  manipulator  was  considered,  and 
[22],  where  the  authors  develop  a  3D  dynamic  model  for 
a  constant  length,  non-extensible  continuum  manipulator.  As 
such,  the  complete  dynamic  modeling  of  variable  length  con¬ 
tinuum  robot  arms  remains  an  open  research  area.  In  [22],  the 
developed  dynamic  model  was  shown  to  satisfy  the  familiar 
property  that  the  continuum  manipulators  inertia  matrix  is 
symmetric  and  positive  definite.  In  this  development,  we  will 
assume  that  the  dynamic  model  of  a  9-joint  continuum  robot 
manipulator  can  be  described  by  the  following  expression 

M(q)q  +  N(q,  q)  =  u  (3) 

where  M{q)  £  R9x9  represents  the  inertia  matrix,  N(q,  q)  £ 
R9  represents  the  remaining  dynamic  terms,  u(t)  £  M9 
represents  the  input  torque  vector,  and  q{t),  q(t),  q(t)  £ 
R9  represent  the  joint  position,  velocity  and  acceleration 
respectively. 


G  Coo  ifq(t),q(t),Q(t )  G  A>o  (4) 
N(-),  Nf),  N(.)  £  Coo  if  q{t),  q(t),  q(t),  q(t)  £  Coo •  (5) 
Assumption  4:  The  inertia  matrix  M{q)  is  symmetric  and 
positive-definite,  and  satisfies  the  following  inequalities 

mi  neii2  <  eTM(g)e  <  m2  neii2  ve  e  r9  (6) 

where  mi,  m2  £  R  are  positive  constants,  and  ||j|  denotes 
the  standard  Euclidean  norm. 


III.  High  Level  Path  Planning 

Whole  arm  grasping  can  be  achieved  by  integrating  the 
path  planner  and  the  controller  such  that  two  tasks,  robot 
end-effector  positioning  and  robot  body  self-motion  position¬ 
ing,  are  accomplished  simultaneously  [23].  The  end-effector 
positioning  controller  forces  the  end-effector  to  follow  a  path 
around  the  object  which  in  turn,  forces  the  robot’s  body 
to  wrap  itself  around  the  object  to  be  grasped.  The  body 
self-motion  positioning  controller  “repels”  the  body  of  the 
manipulator  away  from  the  object  while  the  end-effector 
moves  around  the  object. 


A.  Kinematic  Planning 

To  facilitate  the  kinematic  planning,  the  pseudo-inverse 
of  the  manipulator  Jacobian  denoted  by  J+{q)  £  Rraxp  is 
defined  as  follows 

#  =  jT(jnjT)-1  (7) 

where  J+  (q)  satisfies  the  following  equality 


t  7+  —  T 

°n° n  —  Jp 


(8) 


where  Ip  £  Rpxp  is  the  standard  identity  matrix.  As  shown 
in  [24],  the  pseudo-inverse  defined  by  (7)  satisfies  the  Moore  - 
Penrose  conditions  given  below 


7  7+7—7 

un°n  °n 


7+  /  7+  —  7+ 

0 n  ~  °n 


(j+jny  =  jyjn  (JnJ+y  =  jnjy. 


(9) 


In  addition  to  the  above  properties,  the  matrix  (/„ 
satisfies  the  following  useful  properties 


{In  ~  JiJn)  ( In  -  Jn  ^ci)  —  In  Jn  Ici 
{In  ~  J+Jnf  =  {In  ~  J+  Jn) 

Jn  {In  -  1^  Jn)  =  0 
{In  —  J+Jn)  jy  =  o 


jyjn) 


(10) 


where  In  £  Rnxrl  is  the  standard  identity  matrix. 

Based  on  (2)  and  the  above  properties  the  kinematic 
planner,  denoted  by  77(7)  £  R",  that  enables  the  whole  arm 
grasping  objective  is  designed  as  follows 


77(7)  =  J+77e  +  (/„  -  jy  Jn)  Um  (1 1) 


where  Ue  ( t )  £  Rp  is  the  end-effector  positioning  controller, 
and  Umft)  £  R”  is  the  robot  body  self-motion  controller. 
The  objective  of  the  end-effector  positioning  controller  is 
to  force  the  end-effector  to  track  a  desired  trajectory  en¬ 
compassing  the  surface  of  the  object  to  be  grasped.  We  will 
utilize  a  task  space  velocity  field,  [25],  for  the  end-effector 
positioning  because  it  more  effectively  penalizes  the  end- 
effector  for  leaving  the  contour  and  does  not  exhibit  the 
radial  reduction  phenomenon  [25],  [26].  For  example,  when 
the  object  to  be  grasped  is  circular,  the  velocity  field  can 
be  designed  to  generate  a  desired  trajectory  which  forces  the 
end-effector  to  spiral  inwards,  toward,  and  around  the  surface 
of  the  object. 

The  end-effector  positioning  controller  is  designed  [23]  as 

2 

p2(xn,xd)e  (12) 

where  i9  (xn)  £  Rp  is  a  task-space  velocity  field  which 
encircles  the  object  to  be  grasped,  Ke  £  Rpxp  is  a  positive 
definite  diagonal  gain  matrix,  kn  £  R+  is  a  scalar  gain 
parameter,  eft)  £  Rp  is  the  error  between  the  desired  and 
actual  task  space  position  and  is  defined  as  follows 

e  —  Xd  —  xn,  (13) 

where  Xd  (t)  £  Rp  is  the  desired  task-space  position,  and 
xn  ( t )  was  introduced  in  (1).  In  (12),  V(xd)  6  I  is  a  first 
order  differentiable,  nonnegative  function,  and  p  (■)  £  R  is 
a  known  positive  function  that  is  assumed  to  be  bounded 
provided  xn(t)  and  xd(t)  are  bounded.  The  desired  task 
space  velocity  trajectory  is  defined  as 

xdft)=ti(xn)  (14) 

where  i9  (xn)  is  the  velocity  field  generated  by  the  task- 
space  position  xn(t).  Refer  to  [23]  for  more  details  of  this 
development. 

The  objective  of  the  body  self-motion  positioning  con¬ 
troller  is  to  “repel”  the  end-effector  and  body  of  the  ma¬ 
nipulator  away  from  the  object  to  be  grasped,  while  the  end- 
effector  moves  around  the  object.  This  repulsion-like  prop¬ 
erty  facilitates  obstacle  avoidance  and  removes  the  “slack” 
from  the  body  of  the  manipulator  as  the  robot  moves  into 
the  grasping  position.  Following  this  line  of  reasoning,  the 
body  self-motion  positioning  controller  Um(t)  £  Rn  of  (11), 
is  designed  [23]  as  follows 

Um  4  -km  [Js  (/„  -  J+Jn)] T  ya  (15) 

where  km  £  R+  is  a  control  gain,  Js  £  Rlx"  is  a  Jacobian- 
like  vector,  In  £  Rnxn  is  the  standard  identity  matrix,  and 
ya  ft)  £  R  is  an  auxiliary  scalar  signal  encoding  geometric 
information  about  the  object’s  surface  and  it’s  relationship 
to  the  manipulator  joint  positions.  This  type  of  geometric 
encoding  keeps  the  body  of  the  manipulator  away  from  the 
object  during  the  initial  phases  of  grasping. 

For  whole  arm  grasping,  a  specific  auxiliary  signal  ya  (t) 
is  designed  as  follows 

m 

Da  =  y ^hqj  (Xj)  (16) 

i=  1 


Ue  =  d  (xn)  +  Kee  +  kn 


dV(xd) 


dxd 


where  m  is  the  number  of  sections  of  the  redundant  manipu¬ 
lator,  xt  =  [  Xn  Xi2  ■  ■  ■  XiP  ]  £  Rp  is  the  Euclidean- 

space  coordinate  for  the  ith  section,  and  hai  ( x, )  £  R  is  the 
repulsion  function  for  the  ith  section  that  encodes  geometric 
information  about  the  surface  of  the  object  with  respect  to 
the  ith  section’s  Euclidean  position.  The  repulsion  function 
hai  (xj)  is  defined  as  follows 

hai(xi)  =  khiexp(-ai/3f  (xi))  ,  Vi  =  l,  ..,m  (17) 

where  kf,j,  on  £  R+  are  constants,  and  /3,  {xf}  £  R  is 

the  section  specific  geometric  function.  The  function  /3,;  (xi) 
should  be  designed  to  be  positive  when  the  manipulator  is  not 
touching  the  object.  For  example,  given  a  spherical  object  in 
three  dimensional  Euclidean-space,  fy  (xf)  could  be  defined 
as  follows 

Pi  (x»)  =  (x*i  -  Xcl )2  +  fxi2  -  xc2)2  +  (xi3  -  xc3)2  -  r2 


where  xci ,  xc2,  xc3,  rQ  £  R  are  the  Euclidean  coordinates  of 
the  center  of  the  spherical  object  and  its  radius,  respectively. 
For  more  details  of  this  development  refer  to  [23]. 


B.  Fusing  the  Planner  with  the  Controller 

To  fuse  the  high  level  path  planner  with  the  low  level  joint 
controller,  we  use  a  desired  trajectory  generator  which  en¬ 
sures  that  the  desired  trajectories  for  the  manipulator’s  joints 
are  bounded.  The  structure  of  the  desired  trajectory  generator 
is  motivated  by  the  choice  of  the  low  level  controller  (25), 
which  requires  the  desired  trajectory  be  bounded  up  to  its 
fourth  derivative. 

To  ensure  that  the  desired  joint  space  velocity  trajectory 
is  bounded,  we  could  use  the  following  expression 

qd  ft)  =  sat  (U (t))  (18) 


where  Uft)  was  defined  in  (11),  qd(t)  €  R9  are  the 
desired  joint  velocities,  sat(f)  £  R™  is  defined  as  sat(f)  = 
[saf(£i),---  ,sat(fn)]T  ,fn]T  £  1"  where 

sat(fi)  £  R  Vi  =  1  ,•••  ,n  is  the  following  saturation 
function 

frn'iri  if  f  i  f  £ min 

ft  if  ft  P  fn tit/  or  ft  <  ft, tax 

fmax  tf  fi  —  fmax 

where  fmin •  fmax  £  R+  are  constants.  If  (18)  is  used  to 
generate  the  desired  trajectory,  we  cannot  prove  that  the 
desired  joint  trajectory,  qd(t)  £  R9  is  bounded,  so  we  could 
use  the  following  filtering  operation 

qd(s)  =  ^  l+  ^  sat(U{t ))  (19) 


where  s  £  C  is  the  standard  Laplace  variable,  and  e  £  R+ 
is  a  small  constant.  However,  in  the  case  of  (19),  we  cannot 
prove  that  the  higher  order  derivatives  of  qd{t)  will  remain 
bounded.  So  the  desired  trajectory  generated  for  qdft)  is 
modified  further  in  the  final  form  given  by 


%(*)  = 


zsat  ( Uft )) 


(20) 


where  k  £  R+  is  large  constant.  From  (20),  it  is  clear  that 

Qd  ft) ,  qd  (t) ,  qd  (t) ,  qd  ft) ,  and  q  d  ft)  £  Jf-oo- 


IV.  Low  LEVEL  JOINT  CONTROL  OBJECTIVE 

The  low  level  control  objective  is  to  design  a  continuous 
controller  which  provides  asymptotic  tracking  of  the  manip¬ 
ulator  joint  position  and  the  desired  trajectory  in  the  sense 
that 

q(t)  -*•  qd(t)  as  i  — >  oo.  (21) 

To  quantify  the  control  objective,  an  error  signal,  denoted  by 
ei(f)  gl9,  is  defined  as  follows 

ei  =  qd~  Q-  (22) 

Furthermore,  an  auxiliary  tracking  error  signal  e2  (t)  £  K9  is 
defined  as  follows 

e2  =  ei  +  Ajei  (23) 

where  Ai  £  R+  is  a  control  gain.  For  the  closed  loop  error 
system  development,  we  define  a  filtered  tracking  error  signal 
r(t)  £  R9  as  follows 

r  =  e  2  +  A2e2  (24) 

where  A2  £  R+  is  a  control  gain. 

The  dynamic  model  of  the  continuum  robot  is  a  nonlinear 
uncertain  system;  hence,  the  strategy  developed  by  Xian  et. 
al.  [15]  can  be  used  for  the  joint  level  controller.  This  con¬ 
troller  is  chosen  because  it  is  continuous,  it  does  not  require 
the  dynamic  model  of  the  manipulator  or  contact  forces  to  be 
known  and  yet  it  provides  semi-global  asymptotic  tracking. 
Specifically,  the  control  objective  described  in  (21)  can  be 
met  with  the  following  controller  [15] 

u(t)  =  (Ka  +  I)e2(t)  -  (Ka  +  I)e2(t0 )  +  I  f(T)dr 

Jt0 

+  f  (A 2(Ks  +  I)e2(r )  +  / 3sgn(e2(T )))  dr( 25) 
Jt  o 

where  u(t)  £  R9  is  the  control  input  defined  in  (3),  A2  £ 
M+  is  a  control  gain  Ks,  f3  £  R9x9  are  positive  definite 
diagonal  control  gain  matrices,  f(t)  £  R9  is  the  neural 
network  feedforward  component  and  sgn(-)  :  R9  i— >  R9 
denotes  the  vector  signum  function  defined  as  sgn(£)  = 
[sgn(ti),--- ,sgn(£9)]T  =  [£i,  •  •  •  , /s>]T  £  R9-  The 

controller  presented  in  (25),  provides  semi-global  asymp¬ 
totic  convergence  of  the  joint  position  tracking  error,  (i.e. 
||ei(f)||  — >  0  as  £  — >  oo).  For  a  detailed  analysis  of  the 
controller  the  reader  is  referred  to  [15]. 

Remark  1:  The  design  of  the  neural  network  feedforward 
component,  /(f),  is  presented  in  the  subsequent  section.  The 
only  restriction  imposed  on  the  neural  network  component 
by  the  selection  of  the  controller  (25)  is  that  f(t )  £  Coo. 

A.  Neural  Network  Feedforward  Design 

The  neural  network  feedforward  component  /(f)  £  R 9  is 
computed  using  a  two  layer  network  with  15  neurons1.  The 

1  The  number  of  neurons  required  for  the  system  was  determined  exper¬ 
imentally  by  noting  the  performance  achievable  with  a  given  number  of 
neurons  and  increasing  the  number  of  neurons  until  satisfactory  tracking 
performance  was  obtained. 


weights  are  computed  using  a  modified  version  of  the  back 
propagation  algorithm  presented  in  [14].  Given  Remark  1, 
an  important  consideration  regarding  the  design  of  the  neural 
network  feedforward  component  is  that  the  output  from  the 
neural  network  must  always  be  bounded  (i.e.  /(f)  £  Coo)-  To 
this  end  the  neural  network  component  is  defined  as  follows 

/  =  WTd  (vTx^j  .  (26) 

where  W(t)  £  R15x9  and  V(t)  £  R37x15  are  estimated 
weight  matrices,  and  x(t)  £  M3'  is  the  input  vector  to  the 
neural  network  which  is  selected  as 

x=[l,  Qd’  Q'd  ]T  (2?) 

where  fjd(t),  qd(t),  q_d(t ),  Qd( f)  were  previously  defined.  The 
vector  activation  function  d(-)  £  R15  i— >  R15  is  defined  as 
follows 


<t(uj)  =  [cr(wi),  a(u>2),  ■■■  ,  cr(wi5)] 

where  u>  =  [oj\,  ix2,  ■  ■  ■  ,  wi5]T  and  er(s)  :  R  i 

sigmoid  activation  function  defined  as 

1 


cr(s)  = 


1  +  exp(-s) 


(28) 
is  the 

(29) 


The  gradient  of  the  vector  activation  function,  denoted  by 
<7  (•)  £  R15x15  can  be  expressed  in  closed  form  as  follows, 
[14] 

d(u))  =  diag{d(Lo)}[I  —  diag{d(io)}} .  (30) 


If  we  were  to  design  the  weight  update  laws  according  to 
the  augmented  backpropagation  algorithm  [14],  we  would 
use  the  following  update  rule 


W  =  —nF  ||r||  W  —  Fa  (-)VTxrT  +  Fa(-)rT 

V  =  -kG  ||r||  W  +  Gx  {a'T(-)W^T 

where  k  £  R+  is  selected  to  be  a  small  constant,  F  £ 
R15x15,  G  £  R37x37  are  positive  definite  gain  matrices, 
x(t)  is  the  input  vector  defined  in  (27),  and  r(t)  is  the 
filtered  tracking  error  signal  defined  in  (24).  Here,  the  filtered 
tracking  error  signal  r(t)  is  required  in  the  update  laws 
which  requires  the  measurement  of  the  manipulator  joint 
acceleration,  and  hence,  this  is  undesirable.  To  ensure  that 
the  weights  generated  from  this  law  are  bounded,  and  that 
joint  acceleration  measurements  are  not  required,  we  redefine 
the  update  laws  as  follows 

W  =  -awW  +  7ict  (vTx)  sat(e2  +  C)T  (31) 


V  =  —avV  +  72a: 


t  (vTx^  IHsat  (e2  +  /) 


(32) 


where  av,  aw  £  R+  are  small  constants,  71,  72  £  R+  are 
control  gains  which  effect  the  learning  speed,  the  function 
sat(f)  :  R15  1 — ►  R15  was  previously  defined,  and  the 
auxiliary  signal  ((t)  £  R9  is  a  surrogate  (i.e.  a  dirty 
derivative  operation)  for  the  signal  e2(t)  which  is  defined 
as  follows 

(33) 


C  =  _(e2  -  rj) 

£ 


where  e  £  R+  is  a  small  constant,  and  the  signal  77(f)  £  R9 
is  updated  according  to  the  following  expression 

V  =  _(e2  —  v)-  (34) 

e 

From  equations  (26)-(34)  and  the  fact  that  the  input  vector 
to  the  neural  network  is  bounded,  it  is  easy  to  show  that  the 
weight  matrices  W  (t)  and  V  ( t )  are  bounded,  and  hence,  the 
output  from  the  neural  network,  /(f),  is  bounded. 

V.  Experimental  Results 


Actuator  Encoders  End  Plates  String  Encoder  Cable  String  Encoder 


Section  3  Section  2  Section  1 


Fig.  1:  The  OCTARM  V.2  robotic  manipulator. 

To  verify  the  performance  of  the  controller  with  the 
neural  network  feedforward  component,  the  controller  was 
implemented  on  the  OCTARM  continuum  robot  manipulator. 
In  this  section,  we  first  provide  a  description  of  the  OCTARM 
continuum  robot  manipulator,  then  experimental  results  are 
provided  which  demonstrate  the  effectiveness  of  the  neu¬ 
ral  network  feedforward  tracking  controller.  Research  work 
for  the  OCTARM  robotic  manipulators  is  being  conducted 
for  the  Soft  Robot  Manipulators  and  Manipulations  project 
supported  by  the  DARPA  BIODYNOTICS  program.  This 
work  is  a  multidisciplinary  and  multi-institutional  effort,  the 
reader  is  referred  to  [2],  [27],  [28],  for  more  details  of  the 
project.  The  team  members  at  Penn  State  University  perform 
the  mechanical  design  and  construction  of  the  arms  while 
the  team  at  Clemson  University  develops  the  electronics, 
kinematics  and  control  systems. 

A.  Description  of  the  OCTARM  Manipulator 

The  OACTARM  manipulator  [28],  [29]  is  a  biologically 
inspired  soft  robot  manipulator  resembling  a  trunk  or  ten¬ 
tacle.  The  OCTARM  is  significantly  more  versatile  and 
adaptable  than  conventional  robotic  manipulators,  capable 
of  adaptive  and  dynamic  manipulation  in  unstructured  en¬ 
vironments.  To  provide  the  desired  dexterity  the  OCTARM 
is  constructed  with  high  strain  extensor  air  muscles  called 
McKibben  actuators,  which  are  constructed  by  covering  latex 
tubing  with  a  double  helical  weave,  plastic  mesh  sheath  [29]. 
These  actuators  provide  the  large  strength  to  weight  ratio  and 
strain  required  for  soft  robot  manipulators. 

The  OCTARM  is  divided  into  three  sections  with  each 
section  capable  of  two  axis  bending  and  extension  allowing 
nine  total  degrees  of  freedom.  The  arm  is  pneumatically 
actuated  with  a  maximum  pressure  of  130  Psi  through  nine 
pressure  control  valves.  To  provide  two-axis  bending  and  ex¬ 
tension,  three  control  channels  are  selected  for  each  section. 
Six  actuators  are  used  in  each  section  one  and  two  and  three 
actuators  are  used  in  section  three.  The  six  actuator  design 


has  two  actuators  for  each  control  channel  and  results  in 
actuators  located  at  a  larger  radius,  corresponding  to  higher 
stiffness  and  load  capacity.  Three  closely-spaced  actuators 
provide  high  curvature  for  the  distal  section.  To  provide 
torsional  motion,  the  OCTARM  has  been  fitted  with  a  D.C. 
motor  at  the  base.  In  OCTARM  V.2  (see  Figure  1),  the  D.C. 
motor  is  directly  coupled  to  the  end  plate  of  the  base  section 
through  a  reduction  gear  mechanism.  This  arrangement  of 
the  base  motor  limits  the  maximum  rotation  to  1 80  degrees. 

For  closed  loop  control  of  the  OCTARM  manipulator 
accurate  shape  sensing  is  essential.  The  shape  of  the  manip¬ 
ulator  can  be  inferred  by  measuring  the  length  of  each  of  the 
actuators  on  the  OCTARM.  To  measure  the  length  of  each 
actuator,  three  string  encoders  are  used.  Figure  1  shows  the 
three  section  OCTARM  V.2  with  the  string  encoders  attached 
to  the  base  of  section  1  and  optical  encoders  located  at  the 
end  plates  of  section  1  and  2.  The  cables  from  each  of  the 
string  encoders  run  the  entire  length  of  the  arm  through  the 
optical  encoders  at  the  lower  sections,  as  seen  in  Figure  1. 
This  configuration  enables  the  length  of  each  of  the  actuators 
on  the  OCTARM  manipulator  to  be  determined.  To  obtain 
actuator  velocity  measurements,  a  variable  structure  velocity 
observer  is  utilized  (see  [15]). 

The  design  of  these  manipulators  is  constantly  being 
refined  to  provide  stronger  actuators,  additional  sensory 
information,  newer  capabilities  and  eliminate  some  of  the 
problems  with  the  previous  designs.  With  the  OCTARM  V.2, 
due  to  the  arrangement  of  the  string  encoders  at  the  base 
section  and  the  optical  encoders  at  the  end  plates  of  the 
distal  sections,  there  were  a  number  of  protrusions  on  the 
surface  of  the  arm  limiting  its  grasping  capabilities.  Also 
the  air  tubes  for  the  distal  sections  were  coiled  on  the  outer 
surface  of  the  arm,  again  limiting  its  grasping  capabilities. 
Another  problem  faced  with  OCTARM  V.2  was  slippage  of 
the  string  encoder  cable  at  the  distal  sections  which  caused  a 
loss  of  calibration.  To  address  these  issues,  a  new  prototype 
of  the  arm  has  been  developed  called  OCTARM  VI  (see 
Figure  2).  OCTARM  VI  has  also  been  fitted  with  a  rotary 
union  which  has  an  electrical  slip  ring  with  36  electrical 
connections  and  also  provides  12  independent  passages  for 
pneumatic  lines.  This  new  rotary  union  enables  continuous 
rotation  of  the  base  of  the  manipulator.  Shape  sensing  with 
the  string  encoders  has  also  been  reconfigured  in  OCTARM 
VI.  There  are  now  nine  string  encoders  arranged  around  the 
base  section.  New  eyelets  for  guiding  the  encoder  cables 
have  also  been  developed  to  reduce  friction.  In  addition,  the 
electrical  wiring  for  sensors  and  the  air  tubes  for  pneumatic 
channels  have  been  enclosed  inside  the  actuators,  providing 
a  clean  exterior  surface  of  the  arm  for  grasping. 

The  robot  control  system  consists  of  commercial  off-the- 
shelf  Pentium  III  EBX  form-factor  Single  Board  Computer 
(SBC)  with  two  ServoToGo  data  acquisition  boards  which 
provide  analog  and  digital  I/O.  The  computer  runs  the 
QNX  Neutrino  real-time  Operating  System  and  QMotor  [30] 
the  in-house  developed  hard  real-time  control  software  for 
implementation  of  the  control  algorithms.  Data  acquisition 
and  control  implementation  were  performed  at  a  frequency 


of  500  Hz. 


B.  Joint  Trajectory  Tracking  Experiment  Description 

Preliminary  experimental  results  were  obtained  using  the 
OCTARM  V.2  to  demonstrate  the  effectiveness  of  the  neural 
network  feedforward  control.  To  test  the  low  level  con¬ 
troller  with  the  neural  network  component  given  in  (25),  a 
sinusoidal  desired  trajectory  was  selected  for  the  actuator 
lengths.  The  three  actuators  on  a  section  are  120  degrees 
apart  mechanically,  so  the  desired  trajectory  for  each  actuator 
in  a  section  is  shifted  120  degrees  in  phase  from  the  trajectory 
of  the  previous  actuator  in  that  section.  The  trajectory  for 
section  i,  where  i  =  1,2,3  represents  the  three  sections  of 
the  OCTARM,  was  selected  as  follows 


Qdik 


(1  —  exp(— 0.5 1)) 


lo  +  2sin  (  0.06257rf  +  —nk 


V/c  =  1,2,3 


where  qdi  =  [qdn,q<m,qdi3]  G  R3  represents  the  desired 
trajectory  for  the  actuators  on  section  i,  lmini  €  R  represents 
the  minimum  lengths  of  the  actuators  on  section  i  and 
lo  =  7  [cm]  is  the  initial  extension  of  the  actuators  on 
section  i.  The  initial  extension  was  selected  to  keep  the 
operating  pressure  close  to  the  center  of  its  operational  range. 
The  minimum  and  maximum  lengths  of  the  sections  which 
correspond  to  the  minimum  pressure  (0  psi)  and  maximum 
pressure  (130  psi)  respectively  are  physical  limitations  of  the 
actuators  and  were  found  to  be  lmini  =  22.9  [cm],Zmjna  = 
22.4  [cm],  lmin3  —  27.9  [cm],  Imax i  —  35.8  [cm],  lmax2  = 
37.1  [cm],  lmax3  =47.7  [cm]. 

The  system  gains  which  yielded  satisfactory  performance 
were  determined  by  trial  and  error  and  were  as  follows 


Ks  =  diag{l,  1, 1, 1, 1, 1, 1, 1, 1}, 

/?  =  diag{  1, 1, 1, 1, 1, 1, 0.5, 0.5, 0.5}, 
Ai  =  1,  A2  =  1,  71  =  10,  72  =  500, 
av  =  0.001,  aw  =  0.001,  e  =  0.01. 


There  was  no  training  period  utilized  to  determine  the  initial 
values  for  the  weight  matrices  Wit)  and  V(t),  the  matrices 
were  initialized  to  zero. 


C.  Analysis  of  Results 

To  test  the  effectiveness  of  the  neural  network  feedforward 
term,  we  compared  the  controller  in  (25)  with  a  standard  PID 
controller  and  the  controller  given  in  (25)  without  the  neural 
network  component.  To  provide  a  means  to  quantify  the 
performance  of  each  controller,  we  compute  the  following 
measures 

Me  4  f  |e1(r)||2dr  (35) 

Jt  0 


Mu 


A 


||w(r)||2dr 


(36) 


where  Mu(t)  is  a  measure  of  the  energy  expended  by  the 
controller,  and  Me(t)  is  a  measure  of  the  magnitude  of  the 
tracking  error  over  the  period  of  operation  of  the  system. 


TABLE  I:  Comparison  of  Energy  Measures  for  Different 


Controllers 


PID 

u(t)  without  f(t) 

u(t)  with  f(t) 

Me 

2.8357  x 

103 

256.2135 

58.2223 

Mu 

1.3689  x 

106 

2.0459  x  106 

2.069  x  106 

The  performance  of  the  system  was  first  tested  without 
the  neural  network  component.  The  control  gains  for  the 
controller  were  adjusted  till  good  performance  was  obtained. 
Figures  (3,  4,  5)  show  the  actual  and  desired  joint  trajectories, 
joint  tracking  error,  and  the  input  pressure  for  the  controller 
without  the  neural  network  component.  Next,  the  neural 
network  feedforward  was  added  to  the  controller,  and  the 
neural  network  weight  update  law  gains  were  adjusted  till 
best  performance  was  obtained.  Figures  (6,  7,  8)  show  the 
actual  and  desired  joint  trajectories,  joint  tracking  error, 
and  the  input  pressure  for  the  controller  with  the  neural 
network  feedforward  component.  It  can  be  seen  from  Figure 
7  that  the  tracking  error  with  the  neural  network  feedforward 
component  settles  out  to  ±0.5  [cm]. 

To  compare  the  controller  performance  with  and  with¬ 
out  the  neural  network  feedforward  component,  the  energy 
measures  were  computed  for  the  two  configurations.  The 
energy  measures  were  also  computed  for  a  standard  PID 
controller,  these  results  are  presented  to  show  the  perfor¬ 
mance  improvement  obtained  by  using  the  controller  in  (25). 
Table  I,  shows  a  comparison  of  the  performance  for  the  three 
controller  configurations.  It  can  clearly  be  seen  from  Table 
I  that  improved  tracking  performance  is  achieved  by  adding 
the  neural  network  feedforward  to  the  controller. 


Fig.  2:  OCTARM  VI  grasping  a  ball. 


D.  Grasping  Experiment  Description 

The  whole  arm  grasping  experiment  with  the  neural  net¬ 
work  based  joint  controller  will  be  conducted  with  OCTARM 
VI  in  the  upcoming  months.  Figure  2,  shows  the  OCTARM 
VI  grasping  a  circular  object.  The  object  specific  functions 


for  this  planar  application  are  defined  as  follows 

Pi  {Xi)  =  (xi  -  xc)2  +  (yi  -  ycf  -  r2a  Mi  =  1,  ..,6  (37) 

where  Xc  =  [xc,yc]T  £  R2  represents  the  co-ordinates  of 
the  center  of  the  object,  and  ro  £  R  represents  the  object 
radius.  The  task  space  variable  for  each  of  the  three  sections 
and  the  mid-point  of  the  three  sections  are  Xi  =  [xi,yi]T  £ 
R2  Vi  =  1,  ..,6.  The  following  task-space  velocity  field  for 
a  planar,  circular  contour  will  be  utilized  [26] 

Xd  =  0(X6)  =  -2 K(X6)f(X6)  [  ^  Zfy 

+2c<x=>  [  -'4d  ]  <38> 

where  Xd  £  R2  represents  the  desired  task  space  velocity 
for  the  end-effector,  Xq  £  R2  represents  the  end-effector  co¬ 
ordinate,  and  the  functions  /(Xq),  K(Xq),  and  c(Xq)  £  R 
are  defined  as  in  [26]. 

VI.  Conclusion 

We  have  presented  a  neural  network  controller  for  grasping 
control  of  a  continuum  robot  manipulator.  The  feedforward 
neural  network  was  used  to  compensate  for  the  uncertain 
nonlinear  dynamics  of  the  continuum  manipulator,  while 
the  nonlinear  controller  provides  semi-global  asymptotic 
convergence  of  the  joint  position  tracking  error.  Experimental 
results  for  the  OCTARM  soft  robotic  manipulator  operating 
along  a  sinusoidal  trajectory  were  presented.  A  comparison 
of  the  tracking  performance,  both  with  and  without  the  neural 
network  feedforward  component  demonstrates  the  efficacy 
of  the  proposed  neural  network  feedforward  estimation  tech¬ 
nique.  Future  work  will  consist  of  testing  the  whole  arm 
grasping  controller  with  the  neural  network  component  on 
OCTARM  VI. 
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Appendix 

Experimental  Figures 


Section  1  :  Actuator  1  Section  1  :  Actuator  2  Section  1  :  Actuator  3 


Fig.  3:  Actual  and  desired  joint  trajectory  without  neural  network  component,  solid  line  represents  the  actual  joint  trajectory, 

dashed  line  represents  the  desired  joint  trajectory. 
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Fig.  4:  Tracking  error  without  neural  network  component. 
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Fig.  5:  Control  pressure  without  neural  network  component. 


E 

o 


Fig.  6:  Actual  and  desired  joint  trajectory  with  neural  network  component,  solid  line  represents  the  actual  joint  trajectory, 

dashed  line  represents  the  desired  joint  trajectory. 
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Fig.  7:  Tracking  error  with  neural  network  component. 
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Fig.  8:  Control  pressure  with  neural  network  component. 


